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(57) Abstract 

. The invention relates to the cloning and characterisation of a novel gene from M tuberculosis, mpt83 (Sequence ID No 2) encoding 
the protein MPT83 (Sequence ID No I). Also disclosed are various substitucnt parts of the protein and gene including the glycosylation 
sequence, lipoylation sequence, secretion sequence and promoter region, and analogues and derivatives (generally 70 % homology) of these. 
The gene arid its various aspects have utility in generating recombinant DNA expression systems for use in the transformation of cells 
e.g. to produce glycosylated or lipoylated products, or which are regulatable e.g. with macrophage factors. The invention also relates to 
recombinant antigens employing MPT83 or using the mpl83 promoter and vaccines based on the above systems. 
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MPT83 gene from Mycobacterium tuberculosis 
TECHNICAL FIELD 

The invention relates to the cloning and characterisation of a 
novel gene encoding a protein expressed in M tuberculosis and its 
5 analogues. The invention further relates to the glycosylation 
sequence, lipoylation sequence, secretion sequence and promoter 
region of the gene. The invention also relates generally to 
derived recombinant DNA expression systems and cells transformed 
thereby, and also recombinant antigens and applications therefor. 

10 BACKGROUND ART 

The mycobacteria are a subgroup of the actinomycetes . The include 
a number of organisms capable of causing serious infection in 
mammals such as man. Notable pathogenic agents belonging to this 
group are Mycobacterium tuberculosis and Mycobacterium bovis. 

15 Identification of novel antigens of these organisms allow improved 
reagents for diagnostic tests or for incorporation into subunit 
vaccines. Additionally such antigens can be manipulated to use as 
serological markers . 

It is known that MPB70 is an immunodominant antigen of M bovis and 
20 is able to elicit a delayed- type hypersensitivity response and to 
stimulate T lymphocyte proliferation and antibody production. 
MPB70 and its homologue in M tuberculosis, MPB70 have been cloned 
and sequenced and the sequences of the two genes are identical. 

Although protein glycosylation is not normally found in bacteria, 
25 Fifis et al, in the paper "Purification and characterisation of 
major antigens from a M bovis culture filtrate", Infections and 
Immunity 59, 800-807, 1991 isolated three forms of MPB70 from M 
bovis which were recognised by a number of monoclonal antibodies 
specific for epitopes of the M bovis antigen MPB70. These antigens 
30 comprised a major 22kDa antigen occurred in two forms and a 25JcDa 
antigen which was shown to be glycosylated. It was also 
demonstrated that carbohydrate was associated with M bovis 
antigens of 38 kDa, 25 kDa, 22 kDa and 19 kDa. More recently a 
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peptide from a 45 kDa antigen in M tuberculosis was also found to 
be 0 -glycosylated through a threonine residue (Dobos et al (1995) 
Infection and Immunity 63:2863-53). 

The 22 kDa and a glycosylated 25 kDa antigens were reported by 
Fifis et al to be the non- glycosylated and glycosylated forms of 
the same protein and encoded by one gene as they were both 
recognised by a number of monoclonal antibodies specific for 
separate epitopes of the M bovis antigen MPB70. 

DISCLOSURE OF THE INVENTION 

The inventors have now developed a monoclonal antibody specific 
for the glycosylated 25 kDa antigen and have used this to 
establish that separate genes are responsible for production of 
the 22 kDa and glycosylated 25kDa antigens in M tuberculosis . 
They have determined a new gene which is distinct from the known 
mpt70 gene and which encodes the glycosylated 25 kDa antigen. The 
inventors have determined that homologues of these genes are 
present in the related species M bovis. Throughout this 
specification the Af tuberculosis gene encoding the 25 kDa 
glycosylated antigen, and the homologous M bovis gene, will be 
referred to as mpt83 and mpb&3 respectively. 

As there is considerable homology between the mpt83 gene (in M 
tuberculosis) and the mp£>83 gene (in M bovis) , the person skilled 
in the art would appreciate that similar glycosylation and/or 
lipoylation of proteins, peptides or other products by recombinant 
techniques may be performed by. genetic manipulation of the mpb 83 
or any other homologous genes in related species. 

The invention makes use of the mpt83 gene in a variety of ways. 
The mpt83 gene has been further characterised such that the 
promoter, glycosylation, lipoylation and secretion signals 
contained therein can be used to provide proteins (in particular 
antigens) , peptides or other products, with a similar 
glycosylation and/or lipoylation to that determined by the mpt83 
gene and to assist secretion from host cells recognising the 
secretion signal. For instance the inpt83 secretion signal may be 
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used according to a further aspect of the invention in assisting 
secretion of recombinant products from host cells by operatively 
splicing the secretion signal to the nucleotide sequence of the 
protein, peptide or product to be secreted from a transformed host 
5 organism recognising the secretion signal . 

The proteins, peptides and products may include antigens which can 
subsequently be used in the development of vaccines with improved 
immunogenic response. 

Thus the invention discloses not only the gene jnpt83, but provides 
10 also the Jnpt83 promoter or a derivative thereof to express any 
protein in particular in species of bacteria, particularly 
Mycobacteria. Also the use of the mpt83 gene or derivatives 
thereof to glycosylate and/or lipoylate and/or secrete peptides, 
proteins or other products in Mycobacteria or indeed any eukaryote 
15 or prokaryote. Methods to manipulate genes to provide the above 
are well known in the art. 

The present invention extends to any and all single or multiple 
nucleotide additions, deletions and/or substitutions to the gene, 
its glycosylation signal and/or the lipoylation and/or secretion 

20 signal and promoter and additionally to homologues of the gene, 
its glycosylation signal and/or the lipoylation and/or secretion 
signal and/or its promoter having being at least 70% identity with 
the authentic sequences contained herein. The present invention 
also extends to natural recombinant and synthetic forms of the 

25 mpt83 gene, its glycosylation, lipoylation and secretion signals 
and promoter. All such variations are referred to herein as 
"derivatives" . 

Homologues of the gene may conveniently be identified and 
isolated, for instance in actinomycetes or related organisms, by 
30 those skilled in the art from a test sample as follows. The test 
sample is contacted with a probe based on the inpt83 gene under 
suitable hybridisation conditions, and any test DMA which 
hybridises thereto is identified. 

Such screening is initially carried out under low- stringency 
35 conditions, which comprise a temperature of about 37*C or less, a 
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formamide concentration .of less than about 50%, and a moderate to 
low salt (e.g. Standard Saline Citrate ('SSC*) = 0.15 M sodium 
chloride; 0.15 M sodium citrate; pH 7) concentration. 
Alternatively, a temperature of about 50*C or less and a high salt 
(e.g. 'SSPE'= 0.180 mM sodium chloride; 9 mM di sodium hydrogen 
phosphate; 9 mM sodium dihydrogen phosphate; 1 mM sodium EDTA; pH 
7.4). Preferably the screening is carried out at about 37"C, a 
formamide concentration of about 20% f and a salt concentration of 
about 5 X SSC, or a temperature of about 50*C and a salt 
concentration of about 2 X SSPE. These conditions will allow the 
identification of sequences which have a substantial degree of 
similarity with the probe sequence, without requiring the perfect 
homology for the identification of a stable hybrid. The phrase 
'substantial similarity' refers to sequences which share at least 
50% overall sequence identity. Preferably, hybridisation 
conditions will be selected which allow the identification of 
sequences having at least 70% sequence identity with the probe, 
while discriminating against sequences which have a lower level of 
sequence identity with respect to the probe. 

After low stringency hybridisation has been used to identify 
several target sequences having a substantial degree of similarity 
with the probe sequence, this subset of nucleotides is then 
subjected to high stringency hybridisation, so as to identify 
those clones having a particularly high level of homology with 
respect to the probe sequences. High stringency conditions 
comprise a temperature of about 42 *C or less, a formamide 
concentration of less than about 20%, and a low salt (SSC) 
concentration. Alternatively they may comprise a temperature of 
about 65*C or less, and a low salt (SSPE) concentration. Preferred 
conditions for such screening comprise a temperature of about 
42 *C, a formamide concentration of about 20%, and a salt 
concentration of about 2 X SSC, or a temperature of about 65"C, 
and a salt concentration of about 0.2 SSPE. 

Specifically the invention provides the gene (mpt83) which encodes 
the glycosylated 25 kDa antigen of Mycobacterium tuberculosis or a 
derivative thereof. 
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The invention also provides the promoter of the mpt83 gene or a 
derivative thereof; the DNA sequence encoding the glycosylation 
signal and/or the lipoylation and/or secretion signal of the mpt83 
gene or derivatives thereof; DNA sequences encoding the 
5 glycosylation signal and/or the lipoylation and/or secretion 
signal of the mpt83 gene or derivatives thereof. When using a 
promoter sequence the invention provides for a DNA molecule 
comprising a promoter according to the invention operably linked 
to a heterologous protein coding sequence; a DNA molecule 
10 comprising a nucleotide sequence encoding the glycosylation signal 
and/or the lipoylation and/or secretion signal according to the 
invention operably linked to a heterologous protein coding 
sequence. 

The present invention also therefore extends to DNA constructs 
15 comprising portions or derivatives of the mpt83 gene and/or its 
glycosylation sequence and/or its lipoylation sequence and/or its 
secretion sequence and/or its promoter region attached to a 
heterologous nucleotide sequence opera tively linked in a 5' to 3' 
direction. 

20 The present invention also extends to organisms transformed with 
such DNA constructs and particularly species of actinomycetes . 

Known techniques enable functional portions of genes to be spliced 
to nucleotides sequences . In this way the DNA encoding the 
glycosylation signal from the jnpt83 gene can be spliced to a gene 
25 sequence encoding a peptide protein or other product such that 
expression of the spliced gene causes the protein to be 
glycosylated, provided the host cell has the machinery for 
recognising such a glycosylation signal and applying the 
glycosylation. 

30 Similarly splicing the secretion signal to a gene sequence 
encoding a protein peptide or other product allows secretion 
• thereof in a recombinant host cell organism. The heterologous 

protein coding sequence may encode an antigenic protein of a 
pathogen. This has particular utility in the application of 

35 vaccines. This may be performed by forming a suitable vector with 
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which to transform a host production organism which can be a 
prokaryote or eukaryote cell. 

Optionally the promoter sequence of the mptB3 gene may be ligated 
such that any protein can be glycosylated and/or lipoylated and 
expressed in mycobacteria, other actinomycetes recognising the 
promoter. Alternatively the spliced nucleotide sequence of a 
protein peptide or other product and the jnpt83 glycosylation 
signal and/or its lipoylation and/or the secretion signal may be 
used with a suitable vector or otherwise to transform any 
prokaryote or eukaryote cells. The use of recombinant DNA vectors 
as tools in such expression systems is well known. The invention 
covers suitable transformation tools such as vectors containing 
the mpt83 gene sequence, and/or the promoter thereof and or the 
glycosylation and or lipoylation signal thereof and/or the 
secretion sequence thereof. Such genes, coding sequences, 
promoters, DNA sequences encoding the glycosylation signal and/or 
the lipoylation and/or secretion signals, DNA molecules and 
vectors may be obtained in isolated form and in substantially pure 
form. 

The invention further extends to prokaryote or eukaryote cells 
transformed the above mentioned DNA constructs, e.g. transformed 
with vectors or by other means e.g. electroporation. This enables 
any peptide, protein or other product to be glycosylated, 
lipoylated. Using the secretion signal enables the recombinant 
product to be excreted outside the host cell when recombinant 
product produced in a suitable host. 

The invention covers any host cell but in particular, for example 
an actinomycete such as a mycobacterium such as M tuberculosis or 
M bovis (e.g. Af bovis BCG) , transformed or transfected with such 
DNA constructs or vectors according to the invention. The 
advantages of using mycobacteria is that the glycosylation and 
lipoylation machinery is likely to be available and the secretion 
signal recognised by other cellular components. Such host cells 
may have utility as vaccine organisms. 
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The manipulation of the \mpt83 gene in a vaccine organism may 
provide a serological marker, for instance allowing 
differentiation between vaccine and wild- type organisms e.g. 
pathogenic organisms, using e.g. monoclonal antibodies. Such 
5 manipulation may involve preventing expression of the mpt83 gene 
product, or causing an epitope -deficient mutant to be expressed. 

The invention also extends to a vaccine comprising a 
pharmaceutically acceptable carrier or diluent and, as active 
ingredient, a vaccine organism according to the invention. 

10 The invention is also particularly useful in providing recombinant 
antigens wherein the protein peptide or other product referred to 
above is an antigen i.e. from a pathogenic organism. The invention 
also extends to a vaccine comprising a pharmaceutically acceptable 
carrier or diluent and such an antigen or an epitopic part 

15 thereof . 

Also a method for vaccinating a patient against infection by a 
pathogen, which method comprises administering to the patient a 
host cell transformed according to the invention with mpt83 DNA 
sequences; or a protein comprising the following elements linked 
20 in a N- terminal to a C- terminal direction: the glycosylation 

signal and/or lipoylation of the MPT83 or a derivative thereof and 
a heterologous antigenic sequence of a pathogen. 

The invention will now be explained with reference to the 
following non- limiting Figures and Examples. Further embodiments 
25 falling within the scope of the invention will occur to those 
skilled in the art in the light of these. 

SEQUENCE LISTINGS 

Sequence ID No 1 is the nucleotide and amino acid sequence of the 
mpt83 gene product MPB83. 

30 Sequence ID No 2 is the mpt83 sequence cloned into pSM13, 
including the promoter region. 
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FIGURES 

Figure 1 is a restriction map of the mpt83 region of the Af 
tuberculosis genome. EcoRV, BamHl, Clal, Pstl, Sad and Sail 
restriction endonucleases sites are shown with a scale of 1Kb 
5 {kilobases), and also a smaller scale of the mptB3 region which 
was cloned into the plasmid pSM13 . 

Figure 2a shows a large portion of the mpt83 promoter sequence 
that was produced during the characterisation of the full gene. 
The first Clal site (bases 10-15 on the Figure) corresponds to the 
10 CJal site at bases 400-405 on Sequence ID No 2 . 

Figure 2b shows a large portion of the mpt83 gene coding region 
that was produced during the characterisation of the full gene. 
The first base ('G') corresponds to base number 171 on Sequence ID 
No 1. 

15 EXAMPLES 

EXAMPLE 1: A DESCRIPTION OF THE MPT83 GENE 

The mpt83 gene is located generally within the region defined by 
the BamHl and Pstl restriction sites of Figure 1, or alternatively 
the region defined by the (same) BamHl site and the EcoRV sites 
20 and may be defined as the open reading frame of these sequences. 

It may be noted that the first Sail restriction site marked in 
Figure 1 and Figure 2a was putatively identified as such on the 
basis that it was cut by Hindi, which is specific inter alia for 
the Sail sequence. The original sequence has since been revised 
25 slightly and this site is marked as Hindi on Sequence ID No 2 . 

The mpt83 region of the M tuberculosis genome may therefore be 
obtained in any suitable manner, such as excision using the above 
mention restriction enzymes. For exa m ple, total genomic DNA may be 
extracted from M tuberculosis and digested to completion using 
30 BamHl and Pstl. The resultant DNA is fractionated. DNA fragments 
of an appropriate size, for example 2 to 3 Kb, are ligated into a 
vector. The ligation mix is used for transformation or 
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electroporation of competent recipient E Coli. Recombinant 
colonies are screened for the presence of the mpt83 region. More 
detail is given in Example 2 below. 

Sequence ID No 2 shows the complete sequence of the mpc8Z gene. 
5 The coding sequence of the gene begins at position 1149 (this is 
position 1 on Sequence ID No 1} . Sequences upstream to this 
comprise the promoter region which initiates transcription of the 
gene. A typical TATAA box is present between positions 1065-1069. 
A selection of restriction endonuclease sites are listed as 

10 features in Sequence ID No 2. Sequence analysis of the gene shows 
a 660 bp open reading frame encoding protein of 220 amino acids 
with a predicted molecular weight of 22 kDa, and a predicted 
isoelectric point of 4.7. A Shine -Delgarno sequence (AAGGAA) is 
present 9 nucleotides upstream of the initiation (ATG) . The N- 

15 terminal protein mpt83 revealed a typical prokaryotic secretion 
signal peptide and the presence of a lipoprotein signal peptide 
cleavage motif between amino acid residues 22-26, Sequence ID No 
1- 

There is 65% homology with mpt70. The fact that it contains a 
20 typical prokaryote lipoprotein signal peptide signal cleavage 
motif suggests that mpt83 is a lipoprotein anchored in the cell 
wall by a thioester linkage of the amino - terminal cysteine residue 
to the fatty acid moiety of the cell membrane. The 2kDa size 
difference between mpt83 expressed in E coli and in both M 
25 smegmatis and M bovis implies that the protein undergoes 

considerable posttranslational modification in mycobacteria. An 
amino acid sequence comparison between mpt83 and nipt70 reveals the 
presence of a proline rich peptide sequence upstream of two 
threonines and two alanines in mp t8 3 which is not present in 
30 mpt70. The gene contains glycosylation, lipoylation and secretion 
sequences which are detailed below. 

EXAMPLE 2: DETAILED ISOLATION OF THE MPT83 REGION OF THE AT 
TUBERCULOSIS; TRANSFORMATION AND EXPRESSION USING THE GENE 

METHOD 1: To isolate the mpt83 region, total genomic DNA is 
35 extracted from M tuberculosis using standard protocols e.g. Z 
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Zaiimddin Dale JW Journal of General Microbiology 135, pp2347 to 
2355, 1989. 10 micrograms of DNA are digested to completion using 
BamHl and Pstl jointly (100 units of each) in commercially 
available universal buffer. The resultant DNA is fractionated by 
agarose gel electrophoresis with TAE buffer and 0.8% low gelling 
temperature agarose. The gel is stained with ethidium bromide. 
The DNA is visualised under UV irradiation. An agarose block 
containing DNA fragments in the region 2 to 3 Kb in size as 
deduced by molecular weight markers is excised and the DNA 
purified by commercially available glass milk systems. Vector DNA 
e.g. pUC18 (1 microgram) is digested to completion with BamHl and 
Pstl (10 units of each) in the universal buffer. The cut vector 
is purified by PAGE and glass milk. Purified vector and insert DNA 
are quantified separately either by absorbent 280/260 nm 
wavelength or by a commercially available dipstick. Equimolar 
amounts of vector and insert DNA are mixed to a volume not 
exceeding 15 microlitres to which is added ligase buffer (5 
microliters x 5 cone.) and 1 microlitre T4 DNA ligase (2.5 units). 
DNA is incubated at 40*C for 16 hours 2.5 microlitres of ligation 
mix is used in standard transformation or electroporation of 
competent recipient E Coli. Selection can be made for 
recombinants on LB agar containing ampicillin 50 micrograms per ml 
with standard blue-white selection using X-dal and IPTG. White 
recombinant colonies are screened for the presence of MPT83 by 
hybridisation or immunoscreening with monoclonal MBS43 specific 
for the gene product (Goodger et al. Vet. Record 135, 82-85, 
1994) . 

METHOD 2: An alternative is to clone the BamHl -EcoRv fragment. 
This may be obtained by suitable restriction enzyme cutting 
according to the manufacturers instructions. Bacteria such as E 
coli strains DH5 and HB101 and M smegmatis strain mc 2 -155 can be 
transformed with the jnpt83 gene using a suitable vector. For 
example a cosmid library may be constructed of M tuberculosis 
genomic DNA in a suitable plasmid/cosmid such as pYUBl8. 
Transformation may be performed by any suitable method in the art 
as above or by electroporation. 
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Expression of the mptB2 protein may also be ascertained by 
immunoblotting of M bovis or M tuberculosis lysates produced by 
e.g. sonication with any suitable specific monoclonal antibody. 
The monoclonal antibody MBS43, for example, recognises the 25kDa 
5 antigen i.e. the MPT 8 3 antigen. Immunoblotting of the culture 
filtrate and cell lysate shows that the 25kDa antigen is 
predominantly associated with the cell pellet; the mptlQ antigen 
is predominant secreted. 

EXAMPLE 3: PREPARATION OF MPTQ3 PROMOTER BY PCR 

10 To amplify the mpt83 promoter region/ specific primers which are 
5 ' GGATCCGTGGTAGGGGATGT - 3 ' (positions 1 to 20, Sequence ID No 2) 
and 5 ' CGCTTCGATTCCTTTGCTTC - 3 ' (complementary positions 1148 to 
1128, Sequence ID No 2) are used with DNA extracted as described 
earlier from Af tuberculosis. The procedure is essentially that 

15 described by Saiki et al (1988) Science 239:487. The buffer is 
titrated to contain the optimum level of magnesium ions and the 
thermal cycling conditions are as follows: the DNA is denatured at 
98C for 5 minutes and followed by 40 cycles at 94°C for 1 minute 
62*C for 1 and a half minutes for annealing and two minutes at 72*C 

20 for extension. A Tag polymerase is used for amplification of the 
promoter region to give a product with a predicted size of 1148 
bp. To identify the product, samples of the PCR are run on an 
agarose gel and Southern blots are hybridised with the Sail 
fragment (positions 641 to 832, Sequence ID No 2) labelled with 

25 any reporter molecule. 

The full-length mpt8 3 promoter is defined within the sequence from 
position 1 to position 1148 shown in Sequence ID No 2 . A part of 
this sequence may however be capable of acting as a promoter and 
thus forms part of the invention. The full-length or part- length 

30 promoter sequence may be obtained by using the polymerase chain 
reaction (PCR) or may be synthesised. A part -length promoter 
sequence therefore may be obtained by removing nucleotides from 
the 5' -terminus or the 3' -terminus of the full-length promoter 
sequence, for example using an exonuclease such as exonuclease III 

35 or BAL 31. A modified promoter sequence may be obtained by 

introducing changes into the full-length or part- length promoter 
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sequence, for example substitutions, insertions and/or extensions. 
This may be achieved by any appropriate technique, including 
restriction of the full-length or part -length sequence with an 
endonuclease, insertion of linkers, use of an exonuclease and/or a 
5 polymerase and site-directed mutagenesis. A promoter according to 
the invention may have a sequence at least 70% identical to the 
sequence of the full-length or part -length promoter. There may be, 
for example up to ten or up to twenty nucleotide deletions, 
insertions and/or substitutions made to the full-length or part- 

10 length promoter sequence. Whether a part -length or modified 
sequence is capable of acting as a promoter may be readily 
ascertained. The candidate promoter sequence is provided in a 
vector upstream of a protein coding sequence e.g. a marker 
promoter. It is provided at a location at which it is believed to 

15 be operatively linked to the protein coding sequence. A suitable 
host is transformed with the resulting vector. The presence or 
absence of expression of the encoded protein is then determined. 

As described in Example 6, a DNA vector comprising the promoter of 
mpt83 gene or its derivatives may be used to transform a host cell 

20 so as to direct the transcription of nucleotide sequences 

operatively linked downstream thereof. Someone skilled in the art 
would readily realise that to facilitate an insertion of a 
nucleotide sequence, encoding a peptide protein or other product, 
into such a vector would require one or more unique restriction 

25 sites down steam of the promoter, so that the vector can be 

cleaved at such sites and the sequence inserted. The nucleotide 
sequence may well be the mpt83 gene itself. The vectors can 
subsequently be utilised to transform actinomycetes or other 
related organisms such as mycobacteria species (e.g. M 

30 tuberculosis or M bovis) , and then use such a transformed organism 
for production of the peptide, protein or product encoded by the 
nucleotide sequence. 

EXAMPLE 4: THE GLYCOSYLATION AND LIPOYLATION SEQUENCES OF MPT 8 3 

The full-length mpt83 glycosylation sequence is defined by the 
35 nucleotide sequence from positions 1251-1319, Sequence ID No 3. 
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The glycosylation signal as represented by the 23 amino acid 
sequence is defined as: 

TS PKPATS PAAPVTTAAMADPAA 

These are amino acids 35-57 on Sequence ID No 1. 

5 To amplify the mpt83 glycosylation region, specific primers which 
are 5 ' -ACCAGCCCGAAACCGGCGA-3 ' (positions 1251 to 1269, Sequence ID 
No 2) and 5 ' - CGCTGCGGGGTCAGCCATT - 3 ' (complementary positions 1319 
to 1301, Sequence ID No 2) are used with DNA extracted as 
described earlier from M tuberculosis. The procedure is 
10 essentially as described in Example 2. To identify the product, 

samples of the PGR products are run on an agarose gel and Southern 
blots are hybridised with a probe having a nucleotide sequence 
corresponding to bases within the primer -defined area e.g. 1270- 
1301 of Figure 2b. 

15 The full-length mpt83 leader plus lipoylation sequence is defined 
by the nucleotide sequence from positions 1149-1244, Sequence ID 
No 2. The secretion/lipoylation signal as represented by amino 
acid sequence is defined as : 

MINVQAKPAAAASLAAIAIAFLAGCSSTKPVS 

20 This is amino acids positions 1-32, Sequence ID No 1. The 
lipoylation motif itself is: 

LAGCS 

This is amino acids positions 22-26, Sequence ID No 1 

Modified glycosylation or lipoylation sequences may be obtained by 
25 introducing changes into the full-length or part -length sequences 
respectively, for example substitutions, insertions and/or 
extensions. This may be achieved by any appropriate technique, 
including restriction of the full-length sequence with an 
endonuclease, insertion of linkers, use of an exonuclease and/or a 
30 polymerase and site -directed mutagenesis. A glycosylation or 

lipoylation sequence according to the invention may be at least 
70% identical to the sequence of the full-length or part -length 
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glycosylation sequence. There may be up to five, for example up to 
ten or up to twenty nucleotide deletions, insertions and/or 
substitutions made to the full-length or part -length promoter 
sequence. Part of these full-length sequences may however be 
5 capable of directing glycosylation lipoylation respectively and 
thus form part of the invention. The full-length or part- length 
glycosylation or lipoylation sequence may be obtained by using the 
polymerase chain reaction (PCR) or may be synthesised. A part- 
length glycosylation or lipoylation sequence therefore may be 
10 obtained by removing nucleotides from the 5' -terminus or the 3'- 
terminus of the full-length sequence, for example using an 
exonuclease such as exonuclease III or BAL 31. 

Use can be made of the glycosylation sequence or lipoylation 
sequence of jnpt B3 (or their derivatives) to respectively 

15 glycosylate or lipoylate any protein, peptide or other product 
from any eukaryote or prokaryote by ligating these sequences to 
nucleotide sequences encoding said peptide, protein or other- 
product and transformation of suitable host organisms therewith. 
Recombinant DNA vectors, for instance as described in Example 6, 

20 are useful tools in this procedure. In essence such a vector is 
constructed wherein the glycosylation or lipoylation signal 
sequence from mpt83 is operably linked to a nucleotide sequence 
encoding the peptide, protein or product to be glycosylated or 
lipoylated as appropriate. In addition such a vector can include 

25 restriction endonuclease sites such that transcription and 

translation of a nucleotide sequence inserted into said sites 
results in a glycosylated or lipoylated transcription and 
translation product of the nucleotide sequence. Whether a part- 
length or modified sequence is capable of causing glycosylation or 

30 lipoylated of an expressed polypeptide may be readily ascertained. 

A promoter operatively linked to the protein coding sequence may v 
also be provided and may include the mpt83 promoter or a 
derivative thereof. The candidate glycosylation or lipoylation 
sequence is provided at a suitable location with respect to the 

35 protein coding sequence. A suitable host is transformed with the 
resulting vector. The presence or absence of glycosylation or 
lipoylation of the expressed protein is then determined. 
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The full-length mpt83 secretion signal glycosylation sequence is 
defined by base positions 1149 to 1211, Figure 2b . Part of these 
full-length sequences may however be capable of directing 
5 secretion and thus form part of the invention. The full-length or 
part -length secretion sequence may be obtained by using the 
polymerase chain reaction (PCR) or may be synthesised. 

The secretion signal as represented by amino acid sequence is 
defined as : 

10 MINVQAKPAAAASLAAIAIAF 

This is amino acids position 1-21, Sequence ID No 1. 

Modified secretion sequence may be obtained by introducing changes 
into the full-length or part- length sequences respectively, for 
example substitutions, insertions and/or extensions. A secretion 
15 sequence according to the invention may be at least 70% identical 
to the sequence of the full-length or part -length glycosylation 
sequence 

Use can be made of the secretion signal in recombinant systems 
whereby the secretion signal is ligated so as to be operably 
20 linked to any nucleotide sequence encoding a protein peptide or 
other product. Transformation of organisms and in particular 
mycobacteria allow the protein peptide or other product to include 
the secretion signal so that the transcribed peptide protein or 
other product is secreted from the bacterial cell wall. 

25 EXAMPLE 6t DNA CONSTRUCTS 

The invention includes DNA constructs such as vectors comprising 
the nucleotide sequence of the mpt83 promoter sequence and/or 
glycosylation sequence and/or the lipoylation sequence and/or the 
secretion sequence (or derivatives of these) operatively linked 
30 either to the nucleotide sequence of the mpt83 translation product 
or the nucleotide sequence encoding of any protein, peptide of 
product of any prokaryote or eukaryote. 
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Recombinant DNA vectors /constructs such as plasmids are therefore 
useful in transforming organisms particularly mycobacteria such 
that they can be made to express any protein peptide or other 
product. If one requires to glycosylate and or lipoylate a 
peptides, protein or other product from any other product the 
vector would include the nucleotide sequence therefore operatively 
linked to the glycosylation sequence and /or lipoylation sequence. 
In essence such a vector is constructed wherein the glycosylation 
or lipoylation signal sequence from mpt83 is operably linked to a 
nucleotide sequence encoding the peptide, protein or product to be 
glycosylated or lipoylated as appropriate. In addition such a 
vector/DNA construct can include restriction endonuclease sites 
that allow transcription and translation of a nucleotide sequence 
inserted into said sites such that a glycosylated or lipoylated 
translation product of the nucleotide sequence is produced. 
Whether a part -length or modified sequence is capable of causing 
glycosylation or lipoylated of an expressed polypeptide may be 
readily ascertained. A promoter operatively linked to the protein 
coding sequence may also be provided; this may be the jnpt83 
promoter or a derivative thereof The candidate glycosylation or 
lipoylation sequence is provided at a suitable location with 
respect to the protein coding sequence. A suitable host can be 
transformed with the resulting vector. The recombinant sequence 
may be inserted behind a strong promoter of the host organism to 
enhance expression. The presence or absence of glycosylation or 
lipoylation of the expressed protein is then determined to 
ascertain that successful transformation has occurred. 

The addition of the secretion signal to the vector enables the 
recombinant product to be secreted from the cell wall when 
expressed in a suitable host such as mycobacteria species. 

Alternatively, other mycobacterial promoters or promoters 
effective in actinomycetes and other species including 
mycobacteria could be cloned into the mpt83 gene or fusion vector 
systems containing segments of the mpt83 gene as described such as 
to initiate the transcription of the sequences of the mpt83 gene 
which may facilitate both translation and secretion when cloned in 
the mpt83 fusion vector. 
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As used in the present specification, a "DNA vector" is defined to 
include plasmid DNA, lysogenic phage DNA and/or transposon DNA, in 
double or single stranded linear or double or single stranded 
circular form and may or may not be self transmissible or 
5 mobilizable. A person skilled in the art will realise that one or 
more origins of replication may be required. At least one origin 
of replication will allow replication in a prokaryotic cell and 
preferably in an actinomycetes or related organism and more 
preferably in a species of mycobacterium. The vector may be 

10 introduced into a cell by any of a number of techniques such as 
conjugation, mobilization, transformation, transf ection, 
transduction or electroporation amongst others. Transformation of 
eukaryotic or prokaryotic cells with the vectors and DNA is by 
introduction of the vector or DNA by any means and is either 

15 integrated into the cells genome or existing extrachromosomally 
(e.g. as an autonomously replicating plasmid) . Linker sites 
comprising one or more restriction enzyme sites can be introduced 
into the gene at any point after the promoter region, by site 
directed mutagenesis, partial cleavage and linker mutagenesis, 

20 transposition 6f restriction sites or other techniques. 

Vectors may be any indigenous plasmid, lysogenic bacteriophage or 
transposon from any of these bacteria which may be used as a 
source of the DNA vector including shuttle vectors. By a shuttle 
vector as used herein is meant a DNA vehicle capable, naturally or 

25 by design, of replication in both the actinomycetes and related 
species and in bacteria such as E coli, Bacillus sp. or 
Pseudomonas sp. and the like and/or eukaryotic cells such as 
mammalian, yeast or fungal cells. In this case, the vectors 
contemplated herein will contain at least one origin of 

30 replication wherein where necessary replication can occur in a 
prokaryotic cell and further origins of replication for 
replication in different prokaryotic species and/or eukaryotic 
species. Additionally, although the present invention is useful 
for any actinomycete or related organisms, it is particularly 

35 directed to any species of M bovis BCG. 

Although the DNA constructs of the present invention are described 
in terms of DNA, it would be readily apparent to one skilled in 
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the art that the constructs could be maintained and in some cases 
used in corresponding RNA form without falling outside the scope 
of the present invention. 

The invention also includes methods of producing recombinant 
products such as proteins peptides or other products by 
transforming any suitable host organism with vectors as describe 
hereinbefore and the recombinant products formed therefrom. 

EXAMPLE 7: THE USE OF MACROPHAGES IN EXPRESSION OF THE MPT83 
ANTIGENS 

The search to date for antigens of M bovis and M tuberculosis for 
use in immunodiagnosis and vaccination has relied on the 
identification of antigens that are expressed when the organism is 
grown on synthetic medium: it is known that virulence determinants 
in some bacterial species may be controlled by environmentally 
responsive regulators, such that they react to phagocytosis by 
macrophages with increased expression of macrophage - induced 
proteins (MIPs) . The inventors have determined that the rapfc83 gene 
is up -regulated in macrophages, that is to say infecting 
macrophages with M bovis BCG strain Pasteur results in enhanced 
production of the glycosylated product. Culturing transformed 
mycobacteria of the MTb complex with macrophages or macrophage 
factors can therefore be used to enhance yields of recombinant 
proteins peptides or other products referred to above. 

The antigenic profiles obtained by Western blotting BCG grown 
inside macrophages against sera from a calf experimentally 
infected with M bovis reveals a number of antigenic differences 
including the up -regulation of antigens of the 25 JcDa form, of 
MPT83 in response to the macrophage environment. The up -regulation 
of the 25 kDa protein for AT tuberculosis grown inside the 
macrophage cell line 1C-21 or when subjected to heat shock at 45 
or 48°C has also been reported by Alavi & Affronti (1994) J 
Leukocyte Biology 55: 633-641. The expression reached its maximal 
level of expression about 7 h post infection. Thus it will be 
apparent to the skilled person that the mpt83 promoter may have 
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utility in directing the up -regulation of expression of 
heterologous genes inside host macrophages. 

EXAMPLE 8: VACCINES 

The heterologous protein coding sequence referred to above which 
may be operatively linked to the mptB3 promoter and/or 
glycosylation and/or lipoylation and/or secretion signals may 
include the heterologous antigenic sequence of a pathogen i.e. may 
encode a sequence of amino acid residues capable of raising 
antibody to epi topic parts of the antigen. According to the 
invention antigenic products from such pathogens can provided with 
glycosylation or lipoylation according to the invention. In this 
way subunit vaccines against pathogens may be provided with 
antigens or epi topic parts thereof having glycosylation and/or 
lipoylation similar to mpfc83. in order to produce such antigens 
mycobacteria and in particular M smegma tis, M vaccae and BCG are 
preferably used as host organisms to produce such recombinant 
antigens. The use of the secretion signal in combination with the 
antigenic sequence allows the recombinant antigens to be secreted 
from the cell and therefore assists production of the antigens. 
The recombinant antigens can then be used in vaccines . 

In addition live recombinant cells incorporating antigenic coding 
sequences as described above may be used in vaccines to induce an 
immune response. The invention further therefore extends to 
recombinant live transformed cells incorporating antigenic coding 
sequences having glycosylated / lipoylated or secretion signals 
from MTP83. The sequence may thus encode an antigen capable of 
raising neutralising antibody, for example an antigen of an 
infectious agent or pathogen such as a virus, bacterium or 
parasite. As examples of viruses whose antigens may be presented 
there may be mentioned hepatitis A virus, hepatitis B virus, 
hepatitis C virus, foot-and-mouth disease virus, poliovirus, 
herpes simplex virus, rabies virus, human immunodeficiency virus 
type 1, (HIV-1), HIV-2, simian immunodeficiency virus (SIV) , human 
rhinovirus (HRV) , dengue virus and yellow fever virus . The 
antigen of a parasite may be a malarial antigen, such as an 
antigen of Plasmodium falciparum. 
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For vaccination against M tuberculosis itself and M bovis or 
related species the active component of a vaccine therefore may 
include so called "naked DNA* . This therefore covers the use of 
portions of the mpt82 gene (or derivative of) for use as vaccine 
components . 

The invention extends to vaccines including recombinant antigens 
as described above, or vaccines containing live cells transformed 
with mpt83 nucleotide sequences. They may be administered in any 
appropriate fashion. Thus, they may be administered orally or 
parenterally, for example intradermally, intravenously or 
intramuscularly as appropriate. They may be formulated with 
appropriate carriers or diluents such as physiological saline. An 
adjuvant such as aluminium oxide may be provided. Appropriate 
dosages may be selected depending upon the particular vaccine to 
15 be administered and the host to which the vaccine is being given. 
It is particularly preferred to administer in this way M bovis BCG 
transformed according to the invention. 

In addition to using the mpt83 gene, its glycosylation and /or its 
lipoylation and/or its secretion signal and/or its promoter 
region, or combinations thereof, in the construction of a variety 
of vector molecules, the present invention contemplates using 
these nucleotide sequences to delete or insert the mpt83 gene, or 
parts thereof, in M bovis, M bovis BCG or any other mycobacterial 
species for the purpose of creating a vaccine inducing a 
25 serological or cell -mediated Immune response with a pathogenic 
mycobacterium. This has important consequences in diagnostic 
research and disease management in that individuals vaccinated 
carrying a deletion mutation in the mpt83 gene, will mount a 
serological and cell -mediated immune response diagnostically 
distinguishable from an individual vaccinated with a strain 
expressing the mpt83 gene. Techniques for inserting genes or 
inducing insertional or deletional mutations are well known in the 
art. 

M bovis BCG is an attractive candidate delivery vehicle for the 
35 development of new multivalent recombinant vaccines since BCG may 
be delivered orally is safe heat stable cheap and has well- 
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documented adjuvant properties. Many viral and protozoan antigens 
are glycosylated and it is possible that antigens requiring 
glycosylation to stimulate a protective response could be 
engineering to contain glycosylation signals recognised by BCG. 

EXAMPLE 9: DIAGNOSTIC TESTING FOR MYCOBACTERIA 

The invention additionally extends to the use of mpt83 gene 
products including recombinant antigens as described hereinabove 
or epi topic parts thereof for use in testing for infection of 
humans and animal with species of mycobacteria and in particular M 
tuberculosis and M bovis. The mp t8 3 gene products may be provided 
either unconjugated or conjugated with suitable reagents and used 
in diagnostic kits to indicate infection when reacted with e.g. 
blood, sera of infected animals. Such techniques are well known in 
the art. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: THE MINISTER FOR AGRICULTURE , FISHERIES AND 

. FOOD ETC . 

(B) STREET: WHITEHALL PLACE 

(C) CITY: LONDON 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE {ZIP) : SW1A 2HH 

(A) NAME: ROBERT GLYNN HEWINSON 

(B) STREET: CENTRAL VETERINARY LABORATORY 

(C) CITY: ADDLESTONE, NEW HAW 

(D) STATE: SURREY 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP) : KT15 3NB 

(A) NAME: WILLIAM R JACOBS JR. 

(B) STREET: HOWARD HUGHES MEDICAL INSTITUTE, ALBERT 

EINSTEIN COLLEGE 

(C) CITY: BRONX 

(D) STATE: NEW YORK 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE (ZIP) : NONE 

(ii) TITLE OF INVENTION: RECOMBINANT DNA EXPRESSION SYSTEM 
(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 <EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9517494.2 

(B) FILING DATE: 25-AUG-1995 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

fix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1. .663 

(D) OTHER INFORMATION: /product= M MPT83" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 
<B) LOCATION: 1..96 

(D) OTHER INFORMATION: /function= "Lipoylation motif" 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

<D) OTHER INFORMATION: /function= "secretion signal" 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 
<B} LOCATION: 103.. 171 

(D) OTHER INFORMATION: /function= "Glycosylation motif" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG ATC AAC GTT CAG GCC AAA CCG GCC GCA GCA GCG AGC CTC GCA GCC 
Met He Asn Val Gin Ala Lys Pro Ala Ala Ala Ala Ser Leu Ala Ala 
1 5 10 15 



48 



ATC GCG ATT GCG TTC TTA GCG GGT TGT TCG AGC ACC AAA CCC GTG TCG 96 
He Ala He Ala Phe Leu Ala Gly Cys Ser Ser Thr Lys Pro Val Ser 
20 25 30 

CAA GAC ACC AGC CCG AAA CCG GCG ACC AGC CCG GCG GCG CCC GTT ACC 144 
Gin Asp Thr Ser Pro Lys Pro Ala Thr Ser Pro Ala Ala Pro Val Thr 
35 40 45 

ACG GCG GCA ATG GCT GAC CCC GCA GCG GAC CTG ATT GGT CGT GGG TGC 192 
Thr Ala Ala Met Ala Asp Pro Ala Ala Asp Leu He Gly Arg Gly Cys 
50 55 60 

GCG CAA TAC GCG GCG CAA AAT CCC ACC GGT CCC GGA TCG GTG GCC GGA 240 
Ala Gin Tyr Ala Ala Gin Asn Pro Thr Gly Pro Gly Ser Val Ala Gly 
65 70 75 80 

ATG GCG CAA GAC CCG GTC GCT ACC GCG GCT TCC AAC AAC CCG ATG CTC 288 
Met Ala Gin Asp Pro Val Ala Thr Ala Ala Ser Asn Asn Pro Met Leu 
85 90 95 

AGT ACC CTG ACC TCG GCT CTG TCG GGC AAG CTG AAC CCG GAT GTG AAT 336 
Ser Thr Leu Thr Ser Ala Leu Ser Gly Lys Leu Asn Pro Asp Val Asn 
.100 105 110 

CTG GTC GAC ACC CTC AAC GGC GGC GAG TAC ACC GTT TTC GCC CCC ACC 384 
Leu Val Asp Thr Leu Asn Gly Gly Glu Tyr Thr Val Phe Ala Pro Thr 
115 120 125 

AAC GCC GCA TTC GAC AAG CTG CCG GCG GCC ACT ATC GAT CAA CTC AAG 432 
Asn Ala Ala Phe Asp Lys Leu Pro Ala Ala Thr He Asp Gin Leu Lys 
130 135 140 

ACT GAC GCC AAG CTG CTC AGC AGC ATC CTG ACC TAC CAC GTG ATA GCC 480 
Thr Asp Ala Lys Leu Leu Ser Ser He Leu Thr Tyr His Val He Ala 
145 150 155 160 
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£tn HE ^ T n CG AGC AGG ATC GAG GGC ACC CAT CAG ACC CTG CAA 
Gly Gin Ala Ser Pro Ser Arg He Asp Gly Thr His Gin Thr Leu Gin 

170 175 

S f ° ^ Jf 6 GTC ATA 000 GCC CGC ^ ^ CTC ATG GTC AAC 
Gly Ala Asp Leu Thr Val lie Gly Ala Arg Asp Asp Leu Met Val Asn 

180 185 190 

AAC GCC GGT TTG GTA TGT GGC GGA GTT CAC ACC GCC AAC GCG ACG GTG 
Asn Ala Gly Leu Val Cys Gly Gly Val His Thr Ala Asn Ala t£ 
195 200 205 

TAC ATG ATC GAT ACG GTG CTG ATG CCC CCG GCA CAG TA 

Tyr Met lie Asp Thr Val Leu Met Pro Pro Ala Gin 663 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1898 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 1..1148 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 1065. .1069 

(D) OTHER INFORMATION: /standard_name= " TATAA BOX" 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1135.. 1140 

(D) OTHER INFORMATION: /standard_name= " SHINE - DELGARNO 
SEQUENCE* 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 400.. 405 

(D) OTHER INFORMATION: /standard_name= "Clal site" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 641.. 646 

(D) OTHER INFORMATION: /standard_name= "Hindi site" 

(ix) FEATURE: 

(A) NAME/KEY: mi sc_f eature 

(B) LOCATION: 833.. 838 

(D) OTHER INFORMATION: /standard_name= "Sail site" 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 1488.. 1493 
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(D) OTHER INFORMATION: /standard_name= "Sail site" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1892.. 1898 

(D) OTHER INFORMATION: /standard_name= -EcorV site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGATCCGTGG TAGGGGATGT CGCCGACCCG CTGCGCCGCG TGCTCGTCAG CGACGGCAAT 60 

ATTGGTCACC CCGGTCTGCG CACGTTGTCG CAGCAACGTG TCCAGATGGG CGCCGCCAGC 120 

GGCCAGCCCG ACTACCTCGA AACGGTCCGG ATTGTCGGCG ATGACCTGAA GCGCCTGGGT 180 

GCCGATCGAG CCGGTACTGC CCAGCACCAC CACCCGCAAC CGGCCGTCAG CGCGCCCGTC 240 

GGTCGAGTTG GTCACCTCAT CATTGTGCGC CACCACCTCG TTGTCACCGC GCCGCCGGAT 300 

CACGACGCGT CCACCGGTAG CCACACTTCC CCGTGGAATG CAATCGTCTT GATGCCTGCG 360 

CTTGATGCTA AGATGCCATG CGTGCGCACG ACGATCCGTA TCGATGACGA GCTGTACCGC 420 

GAGGTGAAAG CAAAGGCCGC TCGTTCCGGG CGTACCGTGG CCGCGGTTCT TGAAGATGCG 480 

GTGCGGCGTG GTCTCAACCC GCCTAAGCCG CAGGCCGCCG GCCGTTATCG AGTCCAGCCG 540 

TCGGGTAAGG GCGGCCTGCG GCCCGGTGTC GATCTATCGT CCAACGCCGC ACTTGCCGAA 600 

GCGATGAACG ACGGCGTGTC GGTCGATGCT GTGCGTTGAT GTCAACGTGC TCGTTTACGC 660 

GCATCGGGCA GACCTACGGG AGCACGCGGA CTATCGGGGT TTGCTTGAGC GGCTGGCCAA 720 

CGATGACGAG CCGCTGGGTC TACCAGATAG CGTGCTCGCC GGCTTCATCC GGGTGGTTAC 780 

CAACCGCCGC GTCTTCACCG AGCCGACGAG CCCACAGGAC GCATGGCAGG CAGTCGACGC 840 

CCTACTCGCG GCACCCGCAG CCATGCGACT TCGGCCTGGC GAGCGCCACT GGATGGCCTT 900 

TCGGCAGTTA GCGTCCGATG TTGATGCGAA CGGCAACGAC ATTGCGGACG CGCACCTGGC 960 

CGCCTACGCG CTAGAGAACA ACGCAACCTG GTTGAGCGCC GACCGCGGCT TTGCCCGTTT 1020 

CCGTCGACTG CGCTGGCGTC ATCCGTTGGA CGGTCAGACC CATCTATAAC CGGCCCCACT 1080 

CCGAATCACT GGTGTCCACC CAGGAGGACG GCGTTCAACG CCGCCGCAGA AGCAAAGGAA 1140 

TCGAAGCGAT GATCAACGTT CAGGCCAAAC CGGCCGCAGC AGCGAGCCTC GCAGCCATCG 1200 

CGATTGCGTT CTTAGCGGGT TGTTCGAGCA CCAAACCCGT GTCGCAAGAC ACCAGCCCGA 1260 

AACCGGCGAC CAGCCCGGCG GCGCCCGTTA CCACGGCGGC AATGGCTGAC CCCGCAGCGG 1320 

ACCTGATTGG TCGTGGGTGC GC GCAATACG CGGCGCAAAA TCCCACCGGT CCCGGATCGG 1380 

TGGCCGGAAT GGCGCAAGAC CCGGTCGCTA CCGCGGCTTC CAACAACCCG ATGCTCAGTA 1440 

CCCTGACCTC GGCTCTGTCG GGCAAGCTGA ACCCGGATGT GAATCTGGTC GACACCCTCA 1500 

ACGGCGGCGA GTACACCGTT TTCGCCCCCA CCAACGCCGC ATTCGACAAG CTGCCGGCGG 1560 
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CCACTATCGA TCAACTCAAG ACTGACGCCA AGCTGCTCAG CAGCATCCTG ACCTACCACG 1620 

TGATAGCCGG CCAGGCGAGT CCGAGCAGGA TCGACGGCAC CCATCAGACC CTGCAAGGTG 1680 

CCGACCTGAC GGTGATAGGC GCCCGCGACG ACCTCATGGT CAACAACGCC GGTTTGGTAT 17 40 

GTGGCGGAGT TCACACCGCC AACGCGACGG TGTACATGAT CGATACGGTG CTGATGCCCC 1800 

CGGCACAGTA ACGTTCGGCG CGGTCAAGGC GAGGCAGCCC GTGTAGGCGG TTTGCCTCGC 1860 

TCATCCGGCG GCTTCGTGCC GATAGATCAC GTGATATC 1898 
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CLAIMS 

1. The gene (mpt83) which encodes the 25 kDa antigen of 
Mycobacterium tuberculosis or a derivative thereof. 

2. The nucleotide sequence of the mptB3 gene or a derivative 
thereof . 

3. The promoter of the mpt83 gene or a derivative thereof. 

4. The DMA sequence encoding the glycosylation signal of the 
mpt83 gene or a derivative thereof. 

5 . The DNA sequence encoding the lipoylation signal of the mpt83 
gene or a derivative thereof. 

6. The DNA sequence encoding the secretion signal of the mpt83 
gene or derivative thereof. 

7. A DNA molecule comprising a promoter as claimed in claim 3 
operably linked to a heterologous protein coding sequence. 

8. A DNA molecule comprising a sequence encoding a glycosylation 
signal as claimed in claim 4, or lipoylation signal as claimed in 
claim 5, or secretion signal as claimed in claim 6 operably linked 
to a heterologous protein coding sequence. 

9. A DNA molecule comprising the following elements operably 
linked a 5' to 3' direction: 

a promoter as claimed in claim 3; 

a sequence- encoding a glycosylation signal as claimed in 
claim 4, or lipoylation signal as claimed in claim 5, or secretion 
signal as claimed in claim 6, and'; 

a heterologous protein coding sequence. 

10. A DNA molecule as claimed in any one of claims 7 to 9 wherein 
the heterologous protein coding sequence encodes an antigenic 
protein of a pathogen. 
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11. A vector or DNA construct including the mp tS3 gene promoter 
or a derivative thereof and a sequence encoding the mpt83 
glycosylate signal or a derivative thereof, and one or more 
restriction endonuclease sites downstream of said mptB3 
glycosylation sequence such that transcription and translation of 
a nucleotide sequence inserted into said sites results in a 
glycosylated transcription and translation product of said 
nucleotide sequence. 

12. A vector or DNA construct including the mpt83 gene promoter 
or a derivative thereof and a sequence encoding the mpt83 
lipoylation signal or a derivative thereof and one or more 
restriction endonuclease sites downstream of said mptQl 
lipoylation sequence such that transcription and translation of a 
nucleotide sequence inserted into said sites results in a 
lipoylated transcription and translation product of said 
nucleotide sequence. 

13. A vector or DNA construct including the mpt83 gene promoter 
as claimed in claim 3 and its secretion sequence as claimed in 
claim 6 and one or more restriction endonuclease sites downstream 
of said mpt83 secretion sequence. 

14. A vector or DNA construct including the gene, sequence, 
promoter or DNA molecule as claimed in any one of claims i to 10. 

15. A host cell transformed or transfected with a vector or DNA 
construct as claimed in any one of claims 11 to 14. 

16. A host cell as claimed in claim 15 which is a strain of 
Mycobacterium tuberculosis, Mycobacteria bovis or Mycobacterium 
bovis BCG. 

17. A method of producing a glycosylated or lipoylated protein, 
peptide or other product comprising transforming prokaryote or 
eukaryote cells with a DNA construct or vector as claimed in 
claim 11 or claim 12. 
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18. 



A method' of producing a protein, peptide or other product 
comprising transforming prokaryote or eukaryote cells with a 
recombinant DNA vector as claimed in claim 13 or claim 14. 

19. A method of producing a glycosylated protein as claimed in 
claim 17 wherein said glycosylate is enhanced by. culturing said 
actinomycete within macrophages or by the use of macrophage 
factors. 

20. The recombinantly produced product of the mpt83 gene or a 
derivative thereof, or portions thereof. 

21. A recombinantly glycosylated protein, peptide or other 
product where said glycosylation is substantially similar to the 
glycosylation carried out by the mpt83 gene. 

22. A recombinantly lipoylated protein, peptide or other product 
where said lipoylation is substantially similar to the lipoylation 
carried out by the mpt83 gene. 

23. A protein comprising the following elements linked in a N- 
terminal to a C- terminal direction: 

the glycosylation and or lipoylation signal products of the 
25 kDa antigen of Mycobacterium tuberculosis or a derivative 
thereof ; 

any prokaryote or eukaryote protein. 

24. A protein as claimed in claim 23, wherein said prokaryote or 
eukaryote protein is a heterologous antigenic sequence of a 
pathogen. 

25. A recombinant protein, peptide or other product as claimed in 
any one of claims 20 to 23 wherein said protein, peptide or other 
product is a bacterial, viral or other antigen. 

26. A vaccine including a protein claimed in claims 24 or 25 or 
an antigenic epi topic part thereof. 

27. A vaccine capable of inducing a serological or cell -mediated 
immune response diagnostically indistinguishable from a response 



WO 97/08322 



PCT/GB96/02015 



30 



induced by wild- type infection with a pathogenic mycobacterium 
said vaccine comprising a species of microorganism or vims 
carrying a deletion, insertion or modification of the mpt83 gene, 
or a derivative thereof. 

28. An antibody raised against an antigen as claimed in claim 24 
or claim 25 or an epitopic part thereof. 

29. A vaccine including one or more antibodies as claimed in 
claim 28. 

30. A vaccine including the mpt83 gene sequence, a sequence 
encoding a glycosylation signal as claimed in claim 4, or 
lipoylation signal as claimed in claim 5, or secretion signal as 
claimed in claim 6, each being on its own or recombined. 

31. A vaccine as claimed in claim 30 wherein said sequences are 
portions of the full sequences and sufficient to provoke an immune 
response. 

32. A vaccine comprising a pharmaceutically acceptable carrier or 
diluent and an active ingredient consisting of a host cell as 
claimed in any one of claims 15 or 16. 

33. A host cell according to any one of claims 15 or 16 for use 
as a vaccine. 

34. A host cell as claimed in claim 33 having a native mpt83 gene 
or promoter wherein said gene or promoter has been deleted or 
disrupted such as to render the host cell immunologically 
distinguishable from untransf ormed cells. 

35. A method for vaccinating a patient against infection by a 
pathogen, which method comprising administering to the patient a 
vaccine as claimed in any one of claims 26, 27 or 29 to 32. 

36. A diagnostic test kit for testing infection by mycobacteria 
including the MPT83 protein, or its glycosylation signal, or 
lipoylation signal, or secretion signal or derivatives thereof or 
epitopic portions thereof, comprising an antibody or antisera 
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capable of specifically binding with said protein or signals 
either unconjugated or conjugated with suitable reagent compounds. 
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